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PLANT PHYLOGENY! 


ABSTRACT 


Phylogeny reconstruction has become respectable science over the last few decades, and trees are accumulating 
rapidly in the literature. Botanists have been active in this effort and can already cite success stories (e.g., recognition 
of streptophytes, stomatophytes, anthophytes, eudicots). Nevertheless, only a small number of problems have been 
addressed and some of these have resisted solution. To solve the toughest problems, especially those involving ancient, 
rapid radiations, various sources of data will need to be combined, including evidence from fossils. Furthermore, in 
view of limitations in analyzing data sets with many taxa, more attention must be paid to the consequences of different 
taxon sampling strategies and to how large, variable taxa can be represented in more inclusive studies. 

Over the next few years we should continue to move toward a phylogenetic system (monophyletic groups defined 
by ancestry, diagnosed by characters), which will entail the elimination of familiar paraphyletic taxa. We can expect 
increased use of phylogenies by ecologists, molecular biologists, and others, which will force us to attend to the issue 
of the reliability of phylogenetic hypotheses, and will necessitate the development of a database of phylogenetic studies. 
Interactions with population biologists promise to be especially productive, since there are obvious mutual concerns 


centered on the analysis of gene trees and reticulation. 





Over twenty-five years have elapsed since the 
Publication of Willi Hennig’s Phylogenetic Sys- 
tematics (Hennig, 1966), and for well over a de- 
cade botanical systematists have been conducting 
numerical phylogenetic analyses. The focus of the 

uri Botanical Garden’s Annual Systematics 
Symposium on the “Origin and Relationships of 
the Major Plant Groups” suggests that progress 
has been made, and the meeting itself documented 
significant advances in our understanding of several 
of the most challenging phylogenetic questions. The 
aim of this paper is to take stock, in general terms, 
G where things stand. Has phylogenetic analysis 
ad a substantial impact on our understanding of 
plant phylogeny, and where do we go from here? 


PROGRESS 


Although many of the ideas underlying phylo- 
aeto analysis have a rather long history (see 
aw, 1992; Donoghue & Kadereit, 1992), phy- 
k reconstruction has become respectable sci- 
re only over the last few decades. This happy 
umstance can be traced to a series of devel- 
®*pments in the logic of phylogeny reconstruction 


(e.g., Hennig, 1966; Farris, 1983; see Sober, 1988; 
Swofford & Olsen, 1990), to the availability of 
computers and algorithms to implement this logic 
(e.g., Farris, 1988; Maddison & Maddison, 1992; 
Swofford, 1993), and to new sources of evidence, 
especially molecular data (e.g., Fernholm et al., 
1989; Hillis & Moritz, 1990). While it is obvious 
that these developments have had a significant 
impact on plant systematics, it is not entirely clear 
how best to measure the progress that has been 
made. There are, however, several indicators that 


bear consideration. 


LEVEL OF ACTIVITY 


One measure of progress is simply the rate of 
publication of phylogenetic hypotheses. Phyloge- 
netic trees are undoubtedly accumulating in the 
literature, but the magnitude of the effort is difficult 
to assess because no one has been keeping track 
of such information. In a survey of 79 journals 
published in 1989, 1990, and 1991, we recently 
assembled data on 1140 articles that contained 
trees (Sanderson et al., 1993). This remarkable 
number of studies (nearly a tree a day) is surely 
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FIGURE 1. Pie charts showing aspects of the taxo- 
nomic distribution of numerical phylogenetic studies pub- 
lished between 1989 and 1991 (compiled from an updated 
version of a database assembled by Sanderson et al., 
1993). Top: Studies of green plants (chlorophytes; with 
chlorophyll b, starch storage) account for ca. 23% of the 
882 phylogenetic analyses in the Sanderson et al. data- 
base. Middle: Approximately 84% of the green plant 
studies are of angiosperm groups; the remainder are of 
“other seed plants” (specifically conifers, cycads), “other 
vascular plants” (ferns, lycopsids), “other embryophytes” 
(mosses, liverworts), “other green plants” (various “green 
algae"), or included representatives of “all major lines.” 
Bottom: Of 168 angiosperm studies, 49 (ca. 29%) are of 
monocot groups; Asteraceae and Fabaceae account for 
ca. 38% of the “dicot” studies. 


an underestimate inasmuch as we did not survey 
every journal containing phylogenetic studies and 
we were unable to include analyses published in 
books. Nevertheless, our study probably provides 
a reasonably accurate picture of what exists in the 
literature. Of the total, 882 (77%) of the studies 
employed some “cladistic” method (including par- 
simony and some distance methods, but not phe- 
netic clustering methods such as UPGMA), and 


over 471 (53%) of the phylogenetic studies used 
some form of molecular evidence (including allo- 
zyme data), with the percentage of such studies 
increasing from 51% to 56% over the three-year 
period. 

We identified 312 studies of green plant groups 
(ca. 27% of the total), namely organisms with chlo- 
rophyll b and starch storage, including the various 
lines of “green algae” and the land plants (San- 
derson et al., 1993). However, 113 (36%) of these 
employed only phenetic techniques, a greater per- 
centage of phenetic studies than we found in any 
other major group of eukaryotes. A breakdown of 
the remaining 199 numerical phylogenetic studies 
of green plants (ca. 23% of all phylogenetic studies) 
is shown in Figure 1. The vast majority (168, 84%) 
of these were of angiosperm groups. Within an- 
giosperms, Asteraceae (30 phylogenetic studies) 
and Fabaceae (15 studies) received the most at- 
tention, while other large families, such as Orchi- 
daceae (4 studies) are underrepresented. The 49 
studies of monocots account for around 29% of all 
angiosperm studies, with 14 of these within grasses. 
To some extent these numbers reflect the number 
of species per group, but historical factors are 
evident as well. For example, studies of Asteraceae 
were stimulated by the early and promising rè- 
striction site studies of Jansen, Palmer, and a 
leagues (see references in Palmer et al., 1988; 
Jansen et al., 1992). 

Figure 1 also shows that some groups paced 
ceived relatively little attention, especially comes 
ering their importance in understanding phyloge- 
netic relationships among major lines of green gan 
For example, we found only five phylogenetic ie 
yses of vascular plants other than seed plants e: 
ing the three-year period, despite their great : : 
logenetic significance and the erior 

teridologists in phylogenetic questions (8-6. 
ae 1980). Such ast including fossil ear 
(e.g., ‘‘Cladoxylales,” “*Coenopteridaless Rr 
“‘progymnosperms”), are desperately need ag 
der to establish, for example, whether eusp? - 
giate fern groups are more closely rae 
tosporangiate ferns or perhaps to past he 
groups (and hence seed plants). Fortunate! T ad 
terophylls and lycophytes (including in 1992: 
receiving more attention (e.g., Bateman et ab 
Gensel, 1992). : 

Although the absolute number of pee me 
ies of green plant phylogeny increased Ee ee 
year, the percentage of such studies over vest belo 
year period fluctuated around 50%, the oasis 
based on morphology and/or secondary © 


«4c were 
Overall, about half of the molecular studies 
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based on restriction fragment variation, especially 
in chloroplast DNA. The number of plant studies 
based on nucleotide sequences has risen dramati- 
cally since 1991 (e.g., studies of the chloroplast 
gene rbcL), and will undoubtedly continue to in- 
crease in number, but rather few such studies (32, 
24% of the plant molecular studies) were published 
in the 1989-1991 period. Over half of the se- 
quence studies involved ribosomal genes (see Zim- 
mer et al., 1989; Hamby & Zimmer, 1992; Wa- 
ters et al., 1992), and in the near future, especially 
at lower taxonomic levels, analyses of the internal 
transcribed spacer (ITS) regions (e.g., Baldwin, 
1992; Wojciechowski et al., 1993) promise to be 
productive. Phylogenetic studies using “‘single- 
copy” nuclear genes are still rare, but technical 
difficulties in obtaining such sequences have been 
largely overcome, and I expect that they will play 
an increasingly important role. 

Phylogenetic studies that include fossils are still 
rare, despite their potential significance (Donoghue 
et al., 1989).While molecular studies of fossils are 
promising (e.g., Golenberg et al., 1990; Soltis et 
al., 1992), their impact will depend on developing 
the ability to repeatably obtain significant segments 
of DNA from plant parts fossilized in standard ways. 


SUCCESS STORIES 


While it is obvious that botanists have actively 
engaged in phylogenetic research, it is more dif- 
ficult to determine how successful these efforts have 
been. However, in several cases significant progress 
does appear to have been made. Some of these 
Successes are highlighted in the accompanying 
‘ymposium papers, so only a few are noted here 
(Fig. 2). 

i The studies by Mishler and colleagues (see Mish- 
a. al., 1994, this issue) have confirmed earlier 
a that green algal groups with phrag- 
dies especially charophytes such as Coleo- 
as €, are more closely related to land plants than 
n *y are to other “green algae.” They also solidify 
© view that “bryophytes” are paraphyletic, with 
ra more closely related to tracheophytes. These 
i. eses are being borne out by molecular evi- 
"nce (Mishler et al., 1992, 1994). 
aatik & Crane (1991) have made great prog- 
rea 'n establishing relationships among early lines 
a ascular plants, suggesting that several major 
far and morphological innovations pre- 
The: e radiation giving rise to extant groups. 
eir detailed studies of tracheids, coupled with 
Pa remarkable discoveries of Remy and colleagues 
Bametophyte morphology (e.g., Remy, 1982; 


Donoghue 407 


Reconstructing Plant Phylogeny 


references in Kenrick & Crane, 1991), provide 
excellent examples of the importance of fossils in 
understanding the early evolution of vascular plants. 
Again, these conclusions are consistent with recent 
molecular evidence (e.g., Palmer et al., 1988; 
Manhart & Palmer, 1990; Raubeson & Jansen, 
1992). 

Morphological phylogenetic analyses of seed 
plants (Crane, 1985, 1988; Doyle & Donoghue, 
1986, 1992; Loconte & Stevenson, 1990; Doyle 
et al., 1994, this issue; Nixon et al., 1994, this 
issue), while not agreeing in detail, have consis- 
tently concluded that Gnetales are the living group 
most closely related to angiosperms (Fig. 2). This 
clade is also supported (though not very strongly) 
by chloroplast and ribosomal sequence data (Chase 
et al., 1993; Hamby & Zimmer, 1992; but see 
Troitsky et al., 1991; Hasebe et al., 1992). An 
‘‘anthophyte” clade, consisting of these two living 
groups plus Bennetittales and Pentoxylon, is a 
regular feature of the studies that have included 
fossils. Furthermore, with the exception of some 
analyses by Nixon et al. (1994), all of these studies 
have determined that Gnetales are monophyletic, 
with Gnetum and Welwitschia more closely related 
to one another than either is to Ephedra. In this 
case, the molecular data are especially compelling 
(Doyle et al., 1994). 

There has also been consensus on the monophyly 
of angiosperms, a significant conclusion in view of 
lingering adherence to the belief that they are 
polyphyletic (e.g., Krassilov, 1991). Within angio- 
sperms there have also been promising results. 
Perhaps most importantly, the “tricolpate” clade 
of Donoghue & Doyle (1989; “eudicots” of Doyle 
& Hotton, 1991) has surfaced in recent rbcL stud- 
ies (Olmstead et al., 1992; Chase et al., 1993), 
implying that the vast majority of “dicots” (the 
“higher” subclasses of Cronquist, 1988; Takhta- 
jan, 1987) form a clade, as opposed to having 
several separate origins among “magnoliids,”” as 
suggested previously. Further analyses are needed 
to check the robustness of this conclusion. 

Even from this short list it is clear that “success” 
has not been tied to the use of any one type of 
evidence. In fact, the very impression of success 
stems from “making sense” of all of the data, 
including information on fossil and Recent organ- 
isms, morphology and molecules. In stressing this 
“criterion of veracity,” Hennig (1966) provided a 
characteristically convincing analogy, which I have 
reproduced in Figure 3. In this example, the pro- 
cess of accounting for all of the evidence is likened 
to a geographer attempting to assemble the torn 
pieces of a map so as to bring together all of the 
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ANGIOSPERMS 
(flowering plants) 


ANTHOPHYTES 
SPERMATOPHYTES (seed plants) 


LIGNOPHYTES 


TRACHEOPHYTES (vascular plants) 
POLYSPORANGIOPHYTES 
STOMATOPHYTES 
EMBRYOPHYTES (land plants) 


STREPTOPHYTES 
CHLOROPHYYTES (green plants) 
: t 
Ficure 2. An overview of green plant phylogeny (based on analyses cited in the text), showing major ee 
seem reasonably well supported and areas of continuing uncertainty (unresolved polychotomies). Many ihs are 
omitted, and resolution is simplified to highlight familiar clades (see O'Hara, 1992). Only three fossil x aeie 
included (marked by *), which are probably paraphyletic (quotation marks). Some of the best supporte ts 
have long been recognized (e.g., land plants, seed plants, flowering plants), though these have not epee tly 
accepted; others have been re-circumscribed (e.g., tracheophytes) or named (e.g., polysporangiophytes) vend hytes). 
(Kenrick & Crane, 1991); several others have not yet been named (e.g., Charales +Choleochaetales+ Embryop j 
Recognition of the monophyly of a number of terminal taxa (e.g., eudicots) is also an outcome of recent phy’ hyleti 
studies. Note that some traditional groups (e.g., “green algae,” “bryophytes,” “‘gymnosperms””) are not ee 
and are not shown. 
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roads, rivers, and other landmarks. The corre- 
sponding operation in phylogenetic systematics is 
assessing, as Hennig (1966: 130) put it, “whether 
the differently determined views concerning the 
phylogenetic relationships of different groups of 
organisms are in agreement.” Satisfaction is 
achieved when the pieces are assembled so that 
everything fits. One of the central points of the 
example is that a perfectly consistent explanation 
for any one road, or bit of evidence, might turn 
out to be highly unparsimonious when other fea- 
tures, however minor they may have seemed at 
the outset, are taken into consideration. 


OUTSTANDING PHYLOGENETIC PROBLEMS 


For each success story there are, of course, 
many other problems that have not been addressed 
or for which a satisfying result has not yet been 
obtained. Thus, as also shown in Figure 2, rela- 
tionships among the major extant lines of seed 
plants (cycads, ginkgos, conifers, anthophytes) have 
not yet been convincingly resolved (Doyle & Don- 
oghue, 1992; Doyle et al., 1994). Likewise, the 
position of the root of angiosperms, whether among 

magnoliids”” or among “‘paleoherbs,” has re- 
jan unsettled (but see Doyle et al., 1994). In 
oth cases, particular data sets fail to provide con- 
vincing support for one hypothesis over another 
and/ or there are conflicts among the results based 
on different data sets. Even in these cases, however, 
some hypotheses appear less likely to be true than 
i For example, there is little evidence that 
shit of angiosperms is among monocots, and 
roa it interesting that molecular data in particular 
ch ‘ot support rooting the tree in the vicinity of 
eee Furthermore, it is worth noting 
oo appear to be as significant among 
icy molecular trees (contrast Martin & Dowd, 
1993) amby & Zimmer, 1992; Chase et al., 
Tienes among different morphological results 
Si $ ahlgren & Bremer, 1985; Donoghue & 
à ia : 89; Loconte & Stevenson, 1991; Taylor 
ae ey, 1992; Nixon et al., 1994), as they are 

molecular and morphological results. 

e at <0 be done to solve such problems? First, 
Ait, ope to obtain more decisive data. New 
aie z evidence will certainly be forthcoming, 
hi a es sources of evidence are far from 
fa = i see Much of the information present 
tot Set Ei compendia (e.g., Cronquist, 1981) has 
This oa n incorporated in phylogenetic analyses. 
eai n and should be done, but will require a 
eit = Samira of homology hypotheses (Don- 
e & Sanderson, 1994). Furthermore, new ap- 
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Ficure 3. Figures 37 (top panel) and 38 (bottom 
panel) from Hennig (1966), illustrating the “criterion of 
veracity.” Hennig’s original captions are as follows: ““Fig- 
ure 37. Criterion of veracity. In the reconstruction of a 
topographic map from several fragments, a, a’, and a” 
could be interpreted as adjacent sections of a stream 
course. The remaining elements of the map, however, 
remain isolated. The joining of the map fragments is 
wrong.” “Figure 38. Criterion of veracity. If in the re- 
construction of a topographic map from several fragments, 
a, a’, and a” are interpreted as adjacent sections of a 
stream, the other elements of the map also join to form 
a sensible illustration. The joining of the map fragments 
is correct.” 


proaches to morphology, such as detailed studies 
of flower development and function (e.g., Endress, 
1987; Tucker, 1988; Erbar, 1991; Williams et 
al., 1993), continue to provide valuable phyloge- 
netic characters. Comparative analyses that make 
use of recent findings on the molecular mechanisms 
underlying flower development (e.g.., Coen & Mey- 
erowitz, 1991) promise to be especially useful in 
elucidating the homology of the structures that 
have been most heavily used in morphological phy- 
logenetic research. 

Different sources of evidence also can be com- 
bined and analyzed simultaneously (Kluge, 1989; 
Barrett et al., 1991; Donoghue & Sanderson, 1992; 
but see Swofford, 1991; Bull et al., 1993; de 
Queiroz, 1993). Here the hope is that weak signals 
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present in several separate data sets might com- 
plement one another and provide a single stronger 
result. Obviously, this need not be the outcome, 
but it behooves us to explore such possibilities in 
particular cases. In the case of the angiosperm 
root, the combined analysis of Doyle et al. (1994) 
favors a placement among paleoherbs more con- 
vincingly than any single data set. This is not simply 
a matter of a larger number of molecular characters 
swamping out the morphological evidence, because 
some elements in the combined trees are more 
consistent with morphology. The temptation to 
compete molecules against morphology is simply 
counterproductive—combining the evidence may 
provide the best hope of success (Sytsma, 1990; 
Donoghue & Sanderson, 1992; Doyle et al., 1994). 

Incorporation of fossils is likely to be critical in 
cases of presumed rapid radiation in the distant 
past (Donoghue et al., 1989; Huelsenbeck, 1991b; 
Novacek, 1992). Fossils might represent more 
faithfully those character combinations that were 
present closer to the splitting events of interest. 
Their addition could, in effect, “shorten” the lengths 
of branches leading to extant taxa, thereby less- 
ening the chance of spurious connections owing to 
chance convergences (the “long-branch attrac- 
tion” problem; Felsenstein, 1978). Already there 
are examples of this effect (e.g., ‘“‘progymno- 
sperms” relative to seed plants), and the addition 
of recent paleobotanical discoveries (e.g., Crane et 
al., 1989; Pedersen et al., 1991; Cornet, 1986) 
could have a similar effect as regards angiosperms 
(Doyle & Donoghue, 1993). 


METHODOLOGICAL ISSUES 


Solving difficult phylogenetic problems will re- 
quire additional evidence, but it will also depend 
on basic methodological advances. Thus, both the 
thought of combining data sets and the possibility 
of long-branch attractions highlight the need for 
more attention to taxon sampling for phylogenetic 
analysis (e.g., Donoghue & Sanderson, 1992; 
Wheeler, 1992; Kellogg & Watson, 1993). This 
is especially critical in view of the significant lim- 
itations of presently available computer algorithms 
in handling large numbers of taxa (Swofford & 
Olsen, 1990; Maddison, 1991; Maddison et al., 
1992; Penny et al., 1992). Although there is some 
comfort in the extensive sampling of angiosperms 
in the Chase et al. (1993) analysis of 499 rbcL 
sequences, the results are correspondingly difficult 
to interpret. It is unclear whether all (or even any) 
of the most parsimonious trees actually have been 
recovered, and it is practically impossible to eval- 


uate alternative hypotheses critically or to explore 
the robustness of the results using tests such as 
the bootstrap (Felsenstein, 1985; Sanderson, 1989) 
or decay analysis (Bremer, 1988; Donoghue et al., 
1992). Of course, the same limitations apply to 
large morphological studies (e.g., Hufford, 1992; 
Kellogg & Watson, 1993). 

In view of these problems, analyses involving 
large numbers of taxa must employ carefully de- 
signed (and explicit) search strategies to discover 
separate “‘islands”’ of parsimonious trees (Maddi- 
son, 1991; Maddison et al., 1992; Page, 1993b; 
Olmstead et al., 1993). Alternatively, there may 
be ways of reducing the number of taxa without 
sacrificing accuracy. One possibility, the use of a 
single “exemplar” species to represent a large clade, 
has the advantage of avoiding (at least partially) 
the problem of inferring basal states for the group 
in question. This strategy, however, entails its own 
risks. Most importantly, the inadvertent choice of 
an exemplar that is, in reality, nested well within 
the clade in question, might result in a mistaken 
estimate of the true relationships of the group. This 
could happen, for example, if the exemplar be- 
longed to one specialized line within the group. 
which happened to have converged on another 
terminal taxon in the analysis. p 

A second option is the use of a “placeholder: 
for a number of terminal taxa, where there * 
evidence at the outset that these form a clade. in 
this case, some prior information on relationships 
within the group may make it possible to establish 
a set of ancestral conditions, by finding the Reet 
parsimonious character states at the basal a 
the clade in question (algorithms are described ji 
Swofford & Maddison, 1987). This approach, # 
vocated by Mishler (1994) under the name oe. 

Se kere ge 5 hological 
partmentalization,”’ has been used in morp 
analyses for some time (though not always oooh 
explicitly). For example, Donoghue & Doyle (I ys 
performed a preliminary phylogenetic ener 
within a presumably monophyletic core- wer 
clade and used the results to assign states if 
placeholder for that line. Similarly, hr 
(1994) used preliminary hypotheses on re ee 
ships within such groups as monocotyledons to" 
down” information on a set of observed term T 
taxa to a set of ancestral state assignments. *- 
tunately, it may be sufficient to have = eae of 
prior hypothesis of relationships, as = wat 
taxa nested well within the clade in question 


be effectively “‘screened-off from having any f 
sal conditions ( 


that more 


fect on the assessment of ba 
dison et al., 1984). . 
The use of placeholders will mean 
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than one character state will occur in some terminal 
taxa. An alternative approach, advocated by Nixon 
& Davis (1991), is always to subdivide polymorphic 
taxa into monomorphic terminal units. This, of 
course, can only aggravate the real-world problems 
posed by large numbers of taxa. Furthermore, sub- 
division based on the states of a character (one of 
which is presumably ancestral within the group) 
will result in para- or polyphyletic terminal units 
in many cases. For example, if angiosperms were 
subdivided into two terminal taxa based on the 
presence and absence of vessels it is highly likely 
that neither would be a clade. The same would be 
true of many other characters, such as the presence 
and absence of an active cambium, or different 
types of embryo sac development. Inclusion of 
nonmonophyletic terminal taxa in an analysis can 
result in erroneous conclusions about relationships 
and character evolution, at least if taxa are inter- 
preted in the standard way, namely as representing 
real (single) branches of a phylogenetic tree. 

In the end, the best method of coding large 
polymorphic taxa will depend on the circumstanc- 
es, Sometimes it will be feasible and appropriate 
to split a polymorphic terminal unit into two or 
more groups (hopefully monophyletic), and this 
may not significantly increase the overall number 
of taxa. In other cases some form of compart- 
mentalization will be possible, leaning cautiously on 
the results of previous analyses. The latter requires 
that the assumptions used in one analysis be vig- 
orously tested in other, independent analyses, but 
such tests are certainly possible (e.g., Doyle et al., 
1994). Another option is to simply code a terminal 
Si as polymorphic, as can be done in PAUP 
( Swofford, 1993) and MacClade (Maddison & Mad- 
dison, 1992). In the case of multistate characters 
=< as nucleotides at a particular site) this need 
agi be equivalent to coding the taxon as completely 
poms: since particular subsets of states can 

Specified. Although coding a taxon as unknown 
Sometimes have undesirable effects (Doyle & 
> hae 1986; Nixon & Davis, 1991; Platnick 
ib š 991), it may be necessary, especially to 
di Fporate incomplete fossils (Maddison & Mad- 

ison, 1992), 

Even in cases that continue to resist resolution, 

vanes is made when relevant data and analyses 
re published, because, if nothing else, this clarifies 

meee of alternative hypotheses. This is 
rg = course, only to the extent that sources of 
mare (especially voucher specimens) are 
once y documented, and only if the assumptions 
e i ying the delimitation of taxa and characters 
clearly spelled out. Phylogenetic research can 


be of lasting value only if all of the evidence un- 
derlying the results is made available for scrutiny. 
Otherwise, it is impossible to make reasoned choices 
among alternative hypotheses, or to learn anything 
at all from experience. 


PROSPECTS 


What can we look forward to over the next few 
years? Here I will briefly highlight what I believe 
to be several important trends and their likely con- 
sequences. At the base of each of these trends is 
a more fundamental shift to what O’Hara (1988) 
has called “‘tree-thinking”’: understanding diversity 
as the product of an underlying branching process, 
and observed differences as having resulted from 
evolutionary changes along the branches of a tree. 
That the shift away from a linear view of evolution 
is far from complete is evidenced by the language 
still so often used to describe evolutionary history, 
such as “‘lower” and “‘higher” in reference to taxa 
(O’Hara, 1992). Fortunately, however, tree-think- 
ing is likely to figure more prominently in the 
curriculum, with software such as MacClade (Mad- 
dison & Maddison, 1992) making it easy and fun 
to explore the implications of evolutionary trees. 


A PHYLOGENETIC SYSTEM 


Although major changes will undoubtedly take 
time, I suspect that botanists will continue to move 
toward a truly phylogenetic system. By this I mean 
that taxa will be delimited so as to conform to our 
best estimates of phylogeny and will be defined in 
terms of ancestry rather than in terms of the char- 
acters used to diagnose them (Hennig, 1966; de 
Queiroz, 1988, 1992; Donoghue & Cantino, 1988; 
de Queiroz & Gauthier, 1992). In practical terms 
this will mean eliminating some familiar paraphy- 
letic groups. An excellent example is provided by 
phylogenetic research on Asteraceae, culminating 
recently (Bremer et al., 1992) in several new names 
to reflect conclusions that are well supported by 
different sources of evidence (e.g., Barnadesioideae 
as the sister group of the rest of the family). 

Many more changes of this sort will be made as 
our understanding of phylogeny improves. In par- 
ticular, it is clear that some traditional angiosperm 
families are paraphyletic, with one or more mono- 
phyletic families nested within them. For example, 
Cantino (1992) has carefully documented the or- 
igin of several lines of Lamiaceae from within Ver- 
benaceae. Judd et al. (1994) have documented 
similar situations in phylogenetic analyses of other 
presumably closely related families. In particular, 
Asclepiadaceae appear to be nested within a para- 
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phyletic Apocynaceae, Apiaceae within Araliaceae, 
Brassicaceae within Capparaceae, Valerianaceae 
plus Dipsacaceae within Caprifoliaceae, Urticaceae 
within Moraceae, and Aceraceae and Hippocas- 
tanaceae within Sapindaceae. Many of these con- 
clusions based on morphology are now also sup- 
ported by molecular evidence (e.g., Chase et al., 
1993). In most of these cases, we have proposed 
to eliminate paraphyly through recognition of a 
single more inclusive family (Judd et al., 1994). 
This, we hope, will help counteract a bias on the 
part of temperate botanists to segregate and elevate 
the rank of mainly temperate, mainly herbaceous 
groups, leaving behind a residue of woody tropical 
plants with which they are less familiar. 

Although the shift to monophyly may entail rath- 
er major changes in some cases, this should hardly 
be viewed with dismay. Instead, such changes are 
a concrete sign that we are making progress in 
understanding phylogeny. Furthermore, although 
there will be resistance to changing traditional 
names, such changes are very likely to occur in 
the long run because a phylogenetic system is most 
useful in helping us understand evolutionary and 
ecological processes, biogeography, and so on 
(Hennig, 1966). Moreover, recognition of mono- 
phyletic groups makes it much easier to teach plant 
diversity and for students to learn about it. It also 
helps from the standpoint of continued analysis of 
phylogeny to have names for the clades we work 
with. The name “‘anthophyte,” for example, has 
been useful in describing seed plant phylogeny. 
Likewise, we hope that the “‘stem-based” name 
“‘angiophyte” (Doyle & Donoghue, 1993) will fa- 
cilitate discussion of how fossils such as Sanmi- 
guelia (Cornet, 1986) relate to modern angio- 
sperms. 

In addition to better phylogenies, the develop- 
ment of a phylogenetic system requires attention 
to the logic of phylogenetic taxonomy and nomen- 
clatural conventions. Here the analysis and rec- 
ommendations of de Queiroz & Gauthier (1992) 
provide an excellent starting point. Among other 
things, I agree with them that we should seriously 
reconsider the need to designate standard Linnean 
ranks, especially in view of the evident temptation 
to treat taxa assigned to the same taxonomic rank 
as though they are somehow equivalent. Such rank- 
based approaches can lead, for instance, to sys- 
tematic errors in studies of rates of extinction and 


diversification (Doyle & Donoghue, 1993). 


PHYLOGENIES IN USE 


The use of phylogenies by ecologists, paleon- 
tologists, biogeographers, molecular biologists, con- 


servationists, and others will surely continue to 
increase (e.g., Brooks & McLennan, 1991; Harvey 
& Pagel, 1991). In the case of angiosperms, for 
example, the availability of trees has already had 
an impact on studies of a wide variety of characters, 
including vessels (Young, 1981; Donoghue & Doyle, 
1989), dioecy (Donoghue, 1989), double fertiliza- 
tion and endosperm (Donoghue, 1989; Freidman, 
1992; Donoghue & Scheiner, 1992), pollen mor- 
phology (Doyle & Hotton, 1991), pollination (Pell- 
myr, 1992), and self-incompatibility (Weller et al., 
1994). Many other ideas on the direction of mor- 
phological evolution (discussed, for example, by 
Stebbins, 1974; Carlquist, 1975; Cronquist, 1988; 
Takhtajan, 1991) will soon be tested in a phylo- 
genetic context. Similarly, phylogenies will have 
an increasing impact on our understanding of mo- 
lecular evolution. For instance, we can look forward 
to a fruitful period of “reciprocal illumination” 
between studies of plant phylogeny and molecular 
genetic analyses of the mechanisms underlying 
flower development (Coen & Meyerowitz, 1991). 
The use of phylogenies in other studies will force 
phylogenetic systematists to attend to the issue of 
reliability. Quite naturally, those who intend to rely 
on phylogenies will want some indication as to how 
believable they are. A variety of methods have 
been developed (e.g., Felsenstein, 1985; Bremer, 
1988; Sanderson, 1989; Archie, 1989; Hillis 
1991; Huelsenbeck, 1991a; Goloboff, 199 1a; 
Faith, 1991; Donoghue et al., 1992) and there is 
a growing (mostly critical) literature on the logic 
and behavior of such methods (e.g., Carpenter, 
1992; Goloboff, 1991b; Kallersjo et al., 1992; 
Hillis & Bull, 1993). Here it will be critical to 
develop a deeper understanding of the underlying 
philosophical and statistical issues. If it emerges 
that particular methods are unsatisfactory, then it 
behooves us to articulate exactly why this i$ the 
case and then to develop and test better methods. 
In the meantime, users of trees should be wa 
of uncertainties and encouraged to take into 4° 
count ambiguities in estimates of the panes 
and of character change (e.g., Maddison, 1991; 
Maddison et al., 1992; Weller et al., in press). 4 
Progress in assessing confidence will require 
better understanding of possible sources of a : 
such as long-branch attraction (e.g-, Albert ot is 
1994). In particular, we need to know pease 
cisely how to distinguish real signal from T 
results. Despite their limitations (Hillis et al., im 
computer simulations will continue to be a | 
approach to such questions, making it 
explore a very wide range of possible pre 
values (e.g., Lanyon, 1988; Huelsenbeck & Mih 
1993; Kim et al., 1993). Of course, such s 
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leave open the critical question of exactly where 
in parameter space any particular problem is sit- 
uated, but it is nevertheless important to formalize 
the universe of possibilities and the impact of crit- 
ical variables. 

Work with known phylogenies (e.g., Atchley & 
Fitch, 1991; Hillis et al., 1992) and experimen- 
tation with real data sets (e.g., Smith, 1989; Don- 
oghue et al., 1989; Allard & Miyamoto, 1992) 
also will be useful. An approach that deserves more 
attention is the use of random nucleotide sequences 
as benchmarks to help determine whether real se- 
quences have retained signal relative to a particular 
phylogenetic problem (e.g., Miyamoto and Boyle, 
1989; Wheeler, 1990; Maddison et al., 1992). 
For example, David Maddison and I (unpublished 
analysis) are using this approach to evaluate the 
attachment of real outgroup sequences to the Cer- 
atophyllum branch in angiosperm trees based on 
rbcL data (Les et al., 1991; Chase et al., 1993). 
Is this the true position of the root, or are real 
outgroup sequences so highly diverged that, in 
effect, they no longer retain phylogenetic infor- 
mation? Our preliminary studies show that most 

random” sequences (generated under several 
iar also attach to the “long” Ceratophyllum 
en Nevertheless, there are some aspects of 
he behavior of real outgroup sequences that de- 
ee from any of the random sequences we have 
ce ae implying that the real sequences may 
peek entirely devoid of phylogenetic signal. More 
‘Xperiments are needed, as are more sensitive sta- 
tistical tests. 
he ec interest in phylogenies also demands 
oer better ways of making information 
“eta, m accessible to potential users. We need, 
re “ie to assemble a database of phylogenetic 
“lots trees (Sanderson et al., 1993). At the 
oe We should establish standard means for 
p ee Riyiogenptie data as a co-requisite of 
sie Be just as nucleotide sequences are now 
mak oe ansmitted electronically to repositories 
Seia rage (Cinkosky et al., 1991). On a 
ean. evel, such a database would allow sys- 
ie anche rapidly to requests for phy- 
with ee in connection, for example, 
1991) M ation efforts (e.g., Vane-Wright et al., 
Ma. aai] of the full range of 
Sia e c studies surely would facilitate the 
Kia r ‘ats patterns; for example, patterns 
ghue, Per homoplasy (e.g., Sanderson & Dono- 
chee #3 or in tree shape (e.g., Savage, 1983; 
cea: owinski, 1991, 1993). It would also 
geogra ee search for general patterns in 
sraphie distributions, by allowing (at least po- 


ten 
tially) ready access to every tree with taxa pres- 
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ent in particular areas of endemism (e.g., Nelson 


& Platnick, 1981; Page, 1991). 


PHYLOGENIES AND POPULATION BIOLOGY 


Systematics is sometimes portrayed as being 
neatly separated from population biology, and it is 
sometimes implied that this gap should be main- 
tained (e.g., Nixon & Wheeler, 1990). This, I 
believe, is a mistake. On the contrary, interactions 
with population biologists should be promoted ac- 
tively. Such contacts promise to be especially fruit- 
ful in view of obvious mutual concerns centered 
on the analysis of gene trees and the recent emer- 
gence of coalescence theory (Pamilo & Nei, 1988; 
Avise, 1989; Takahata, 1989; Hudson, 1990; 
Doyle, 1992; Page, 1993a; Baum & Shaw, in 
press; Maddison, in press). 

The application of phylogenetic methods to what 
have traditionally been population-level problems 
is yielding promising results already. For example, 
Slatkin & Maddison (1990) devised a phylogenetic 
measure of gene flow, and Felsenstein (1992) has 
approached the estimation of effective population 
size from a phylogenetic standpoint. It is perhaps 
less obvious how systematists, especially those of 
us primarily concerned with deeper branching 
events, stand to benefit from interacting with pop- 
ulation biologists. However, several fundamental 
questions require population-level input. One such 
concern is the possibility of a false estimate of 
phylogeny based on uniparentally inherited organ- 
ellar DNA (e.g., Rieseberg & Soltis, 1991; Doyle, 
1992). Whether lineage sorting, for example, is 
likely to have been an important factor in a par- 
ticular case depends on population sizes and the 
time intervals between splitting events. We rec- 
ognize that these are the important parameters, 
and we know something about their critical values, 
based on models developed by population geneti- 
cists (e.g., Pamilo & Nei, 1988; see Maddison, in 
press). Moreover, population biologists may be able 
to estimate these parameters in particular cases. 
Ultimately, this could bear directly on the way in 
which organellar data are integrated with other 
evidence in phylogenetic analysis (Doyle, 1992). 

Input from population geneticists also will be 
critical in understanding processes such as con- 
certed evolution and the dynamics underlying the 
establishment of paralogous genes, issues of obvious 
interest to molecular systematists (e.g., Zimmer et 
al., 1980; Patterson, 1988; Sanderson & Doyle, 
1992). Better understanding of these phenomena 
will help guard against mistaken comparisons, and 
will allow us to turn them to our advantage in some 
cases. For example, comparison of phylogenies 
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based on paralogous genes may make it possible 
to establish the root of a tree. In fact, this approach 
allowed Iwabe et al. (1989) to root the tree of life, 
despite the lack of obvious outgroups. 

The ability to compare trees based on both nu- 
clear and organellar genes obviously has expanded 
our ability to identify hybrids and specify their 
parentage (e.g., Soltis et al., 1989; Rieseberg & 
Brunsfeld, 1992). Although hybrids have occa- 
sionally been discussed by phylogeneticists (papers 
in Platnick & Funk, 1983; Funk, 1985), detailed 
empirical studies of their impact on phylogeny re- 
construction have appeared only recently (Mc- 
Dade, 1992). Nevertheless, botanists should soon 
play a major role in developing methods for the 
recognition and incorporation of reticulations in 
phylogenetic analysis (e.g., Rieseberg & Morefield, 
in press). Ultimately, what is needed is a general 
theory covering reticulations of all sorts (Page, 
1993a), including recombination in the case of 
genes (Hein, 1990), lateral transfer of DNA (Val- 
dez & Pinero, 1992), and even the fusion of entire 
biotas in biogeography (Cracraft, 1988). The de- 
velopment of such a theory depends on breaking 
down whatever barriers have tended to separate 
population biologists from systematists. 


SUMMARY 


It seems clear that headway has been made in 
understanding plant phylogeny over the last few 
decades, and there is every reason to expect con- 
tinued success. It is also clear, however, that many 
basic phylogenetic questions will be rather difficult 
to answer. New morphological and molecular ev- 
idence, in combination with the old, should even- 
tually point the way to a solution. At the same 
time, methodological advances will allow us to get 
more mileage out of whatever data are available. 

Better phylogenies will lead (hopefully) to tax- 
onomic changes, and will surely encourage the use 
of trees outside of systematics. One result will be 
increased attention to problems of taxon sampling 
and to methods for assessing the reliability of phy- 
logenetic hypotheses. Eventually, we will see the 
development of an even more comprehensive phy- 
logenetic theory, incorporating reticulation of all 
kinds and transcending the boundaries of tradi- 
tional disciplines. 
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